AITopics | Des Moines

Collaborating Authors

Des Moines

Blending 3D Geometry and Machine Learning for Multi-View Stereopsis

Vats, Vibhas, Reza, Md. Alimoor, Crandall, David, Jung, Soon-heung

arXiv.org Artificial IntelligenceSep-16-2025

Traditional multi-view stereo (MVS) methods primarily depend on photometric and geometric consistency constraints. In contrast, modern learning-based algorithms often rely on the plane sweep algorithm to infer 3D geometry, applying explicit geometric consistency (GC) checks only as a post-processing step, with no impact on the learning process itself. In this work, we introduce GC MVSNet plus plus, a novel approach that actively enforces geometric consistency of reference view depth maps across multiple source views (multi view) and at various scales (multi scale) during the learning phase (see Fig. 1). This integrated GC check significantly accelerates the learning process by directly penalizing geometrically inconsistent pixels, effectively halving the number of training iterations compared to other MVS methods. Furthermore, we introduce a densely connected cost regularization network with two distinct block designs simple and feature dense optimized to harness dense feature connections for enhanced regularization. Extensive experiments demonstrate that our approach achieves a new state of the art on the DTU and BlendedMVS datasets and secures second place on the Tanks and Temples benchmark. To our knowledge, GC MVSNet plus plus is the first method to enforce multi-view, multi-scale supervised geometric consistency during learning. Our code is available.

artificial intelligence, computer vision, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2505.0347

Country:

North America > United States > Iowa > Polk County > Des Moines (0.04)
North America > United States > Indiana > Monroe County > Bloomington (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(3 more...)

Genre: Research Report > Promising Solution (0.87)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.96)

Add feedback

Exploring psychophysiological methods for human-robot collaboration in construction

Wong, Saika, Chen, Zhentao, Pan, Mi, Skibniewski, Miroslaw J.

arXiv.org Artificial IntelligenceMar-21-2025

Human-robot collaboration (HRC) refers to scenarios Various psychophysiological-based methods have in which humans and robots work collaboratively toward a been employed to interpret psychological phenomena within common goal, sharing tasks and responsibilities in a way the context of HRC by measuring the brain and physiological that capitalizes on the strengths of both parties [3]. As activity of workers, such as electroencephalography construction tasks become increasingly complex and timesensitive, (EEG) for brain activity [73], photoplethysmography (PPG), the integration of collaborative robots, or cobots, electrocardiography (ECG) for cardiac activity [7], and into the construction industry has emerged as a solution to electrodermal activity (EDA) for skin response [8]. Given all enhance efficiency and simultaneously mitigate operational the merits of these technologies, some initial endeavors on risks [86, 90]. However, real-world deployment of HRC psychophysiological methods for HRC in construction have in construction confronts multifaceted challenges, such as been made. For instance, real-time feedback from individual's trust in robotic capabilities [21], frequent reconfigurations physiological responses [21] and cognitive load [50] of working conditions [43], and communication in noisy has been used to allow cobots to adjust their behavior (e.g., and unstructured environments [24]. These challenges are accelerate, stop, slow down) in response to the changing exacerbated by the reliability and safety issues inherent in workers' conditions. However, studies on wearable-based complicated and dynamic construction activities and environments psychophysiological methods for the construction industry (e.g., human dynamics, non-deterministic features, to date are still limited and embryonic, primarily focusing and the presence of various materials) [49, 50]. To address on interpreting a specific dimension of worker status. While these limitations, the development of HRC is shifting these methods hold promise for advancing human-centric from performance-oriented approaches to human-centrality robot collaboration in construction, their potential has not yet paradigms, emphasizing a comprehensive interpretation of been fully explored, and current applications remain largely collaborative behaviors between humans and their robot experimental.

artificial intelligence, data quality, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2503.17078

Country:

Asia > Macao (0.14)
North America > United States > Iowa > Polk County > Des Moines (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
(6 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > Experimental Study (0.93)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.68)
(3 more...)

Add feedback

Logic-RAG: Augmenting Large Multimodal Models with Visual-Spatial Knowledge for Road Scene Understanding

Kabir, Imran, Reza, Md Alimoor, Billah, Syed

arXiv.org Artificial IntelligenceMar-16-2025

Large multimodal models (LMMs) are increasingly integrated into autonomous driving systems for user interaction. However, their limitations in fine-grained spatial reasoning pose challenges for system interpretability and user trust. We introduce Logic-RAG, a novel Retrieval-Augmented Generation (RAG) framework that improves LMMs' spatial understanding in driving scenarios. Logic-RAG constructs a dynamic knowledge base (KB) about object-object relationships in first-order logic (FOL) using a perception module, a query-to-logic embedder, and a logical inference engine. We evaluated Logic-RAG on visual-spatial queries using both synthetic and real-world driving videos. When using popular LMMs (GPT-4V, Claude 3.5) as proxies for an autonomous driving system, these models achieved only 55% accuracy on synthetic driving scenes and under 75% on real-world driving scenes. Augmenting them with Logic-RAG increased their accuracies to over 80% and 90%, respectively. An ablation study showed that even without logical inference, the fact-based context constructed by Logic-RAG alone improved accuracy by 15%. Logic-RAG is extensible: it allows seamless replacement of individual components with improved versions and enables domain experts to compose new knowledge in both FOL and natural language. In sum, Logic-RAG addresses critical spatial reasoning deficiencies in LMMs for autonomous driving applications. Code and data are available at https://github.com/Imran2205/LogicRAG.

large language model, logic & formal reasoning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2503.12663

Country:

North America > United States > Pennsylvania > Centre County > State College (0.04)
North America > United States > Iowa > Polk County > Des Moines (0.04)
Asia > Singapore (0.04)
Asia > Indonesia > Bali (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Transportation (0.90)
Information Technology (0.75)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
(2 more...)

Add feedback

Data-driven Super-Resolution of Flood Inundation Maps using Synthetic Simulations

Aravamudan, Akshay, Rasheed, Zimeena, Zhang, Xi, Scarpignato, Kira E., Nikolopoulos, Efthymios I., Krajewski, Witold F., Anagnostopoulos, Georgios C.

arXiv.org Artificial IntelligenceFeb-14-2025

The frequency of extreme flood events is increasing throughout the world. Daily, high-resolution (30m) Flood Inundation Maps (FIM) observed from space play a key role in informing mitigation and preparedness efforts to counter these extreme events. However, the temporal frequency of publicly available high-resolution FIMs, e.g., from Landsat, is at the order of two weeks thus limiting the effective monitoring of flood inundation dynamics. Conversely, global, low-resolution (~300m) Water Fraction Maps (WFM) are publicly available from NOAA VIIRS daily. Motivated by the recent successes of deep learning methods for single image super-resolution, we explore the effectiveness and limitations of similar data-driven approaches to downscaling low-resolution WFMs to high-resolution FIMs. To overcome the scarcity of high-resolution FIMs, we train our models with high-quality synthetic data obtained through physics-based simulations. We evaluate our models on real-world data from flood events in the state of Iowa. The study indicates that data-driven approaches exhibit superior reconstruction accuracy over non-data-driven alternatives and that the use of synthetic data is a viable proxy for training purposes. Additionally, we show that our trained models can exhibit superior zero-shot performance when transferred to regions with hydroclimatological similarity to the U.S. Midwest.

artificial intelligence, fim, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2502.10601

Country:

Africa > Ghana (0.05)
Europe > Western Europe (0.04)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Research Report (0.64)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)

Add feedback

Autonomous Building Cyber-Physical Systems Using Decentralized Autonomous Organizations, Digital Twins, and Large Language Model

Ly, Reachsak, Shojaei, Alireza

arXiv.org Artificial IntelligenceOct-24-2024

Current autonomous building research primarily focuses on energy efficiency and automation. While traditional artificial intelligence has advanced autonomous building research, it often relies on predefined rules and struggles to adapt to complex, evolving building operations. Moreover, the centralized organizational structures of facilities management hinder transparency in decision-making, limiting true building autonomy. Research on decentralized governance and adaptive building infrastructure, which could overcome these challenges, remains relatively unexplored. This paper addresses these limitations by introducing a novel Decentralized Autonomous Building Cyber-Physical System framework that integrates Decentralized Autonomous Organizations, Large Language Models, and digital twins to create a smart, self-managed, operational, and financially autonomous building infrastructure. This study develops a full-stack decentralized application to facilitate decentralized governance of building infrastructure. An LLM-based artificial intelligence assistant is developed to provide intuitive human-building interaction for blockchain and building operation management-related tasks and enable autonomous building operation. Six real-world scenarios were tested to evaluate the autonomous building system's workability, including building revenue and expense management, AI-assisted facility control, and autonomous adjustment of building systems. Results indicate that the prototype successfully executes these operations, confirming the framework's suitability for developing building infrastructure with decentralized governance and autonomous operation.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2410.19262

Country:

Asia > India (0.04)
North America > United States > Virginia > Montgomery County > Blacksburg (0.04)
North America > United States > Iowa > Polk County > Des Moines (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Energy (1.00)
Construction & Engineering (1.00)
Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Democratizing Signal Processing and Machine Learning: Math Learning Equity for Elementary and Middle School Students

Vaswani, Namrata, Selim, Mohamed Y., Gibert, Renee Serrell

arXiv.org Artificial IntelligenceSep-25-2024

Signal Processing (SP) and Machine Learning (ML) rely on good math and coding knowledge, in particular, linear algebra, probability, and complex numbers. A good grasp of these relies on scalar algebra learned in middle school. The ability to understand and use scalar algebra well, in turn, relies on a good foundation in basic arithmetic. Because of various systemic barriers, many students are not able to build a strong foundation in arithmetic in elementary school. This leads them to struggle with algebra and everything after that. Since math learning is cumulative, the gap between those without a strong early foundation and everyone else keeps increasing over the school years and becomes difficult to fill in college. In this article we discuss how SP faculty and graduate students can play an important role in starting, and participating in, university-run (or other) out-of-school math support programs to supplement students' learning. Two example programs run by the authors (CyMath at ISU and Ab7G at Purdue) are briefly described. The second goal of this article is to use our perspective as SP, and engineering, educators who have seen the long-term impact of elementary school math teaching policies, to provide some simple almost zero cost suggestions that elementary schools could adopt to improve math learning: (i) more math practice in school, (ii) send small amounts of homework (individual work is critical in math), and (iii) parent awareness (math resources, need for early math foundation, clear in-school test information and sharing of feedback from the tests). In summary, good early math support (in school and through out-of-school programs) can help make SP and ML more accessible.

artificial intelligence, machine learning, student, (16 more...)

arXiv.org Artificial Intelligence

2409.17304

Country:

North America > United States > Indiana (0.05)
North America > United States > Virginia (0.04)
North America > United States > Mississippi (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Education > Educational Setting > K-12 Education > Middle School (1.00)
Education > Curriculum > Subject-Specific Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (0.35)

Add feedback

X-ray Fluoroscopy Guided Localization and Steering of Medical Microrobots through Virtual Enhancement

Alabay, Husnu Halid, Le, Tuan-Anh, Ceylan, Hakan

arXiv.org Artificial IntelligenceSep-12-2024

In developing medical interventions using untethered milli- and microrobots, ensuring safety and effectiveness relies on robust methods for detection, real-time tracking, and precise localization within the body. However, the inherent non-transparency of the human body poses a significant obstacle, limiting robot detection primarily to specialized imaging systems such as X-ray fluoroscopy, which often lack crucial anatomical details. Consequently, the robot operator (human or machine) would encounter severe challenges in accurately determining the location of the robot and steering its motion. This study explores the feasibility of circumventing this challenge by creating a simulation environment that contains the precise digital replica (virtual twin) of a model microrobot operational workspace. Synchronizing coordinate systems between the virtual and real worlds and continuously integrating microrobot position data from the image stream into the virtual twin allows the microrobot operator to control navigation in the virtual world. We validate this concept by demonstrating the tracking and steering of a mobile magnetic robot in confined phantoms with high temporal resolution (< 100 ms, with an average of ~20 ms) visual feedback. Additionally, our object detection-based localization approach offers the potential to reduce overall patient exposure to X-ray doses during continuous microrobot tracking without compromising tracking accuracy. Ultimately, we address a critical gap in developing image-guided remote interventions with untethered medical microrobots, particularly for near-future applications in animal models and human patients.

interface, intervention, microrobot, (16 more...)

arXiv.org Artificial Intelligence

2409.08337

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > Massachusetts > Middlesex County > Somerville (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.47)

Add feedback

Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval

Valluri, Ravisri, Mohankumar, Akash Kumar, Dave, Kushal, Singh, Amit, Jiao, Jian, Varma, Manik, Sinha, Gaurav

arXiv.org Artificial IntelligenceJun-10-2024

Generative Retrieval introduces a new approach to Information Retrieval by reframing it as a constrained generation task, leveraging recent advancements in Autoregressive (AR) language models. However, AR-based Generative Retrieval methods suffer from high inference latency and cost compared to traditional dense retrieval techniques, limiting their practical applicability. This paper investigates fully Non-autoregressive (NAR) language models as a more efficient alternative for generative retrieval. While standard NAR models alleviate latency and cost concerns, they exhibit a significant drop in retrieval performance (compared to AR models) due to their inability to capture dependencies between target tokens. To address this, we question the conventional choice of limiting the target token space to solely words or sub-words. We propose PIXAR, a novel approach that expands the target vocabulary of NAR models to include multi-word entities and common phrases (up to 5 million tokens), thereby reducing token dependencies. PIXAR employs inference optimization strategies to maintain low inference latency despite the significantly larger vocabulary. Our results demonstrate that PIXAR achieves a relative improvement of 31.0% in MRR@10 on MS MARCO and 23.2% in Hits@5 on Natural Questions compared to standard NAR models with similar latency and cost.

nar model, pixar, target vocabulary, (13 more...)

arXiv.org Artificial Intelligence

2406.06739

Country:

North America > United States > Iowa > Polk County > Des Moines (0.05)
Asia > India (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
(2 more...)

Add feedback

Smart Textile-Driven Soft Spine Exosuit for Lifting Tasks in Industrial Applications

Zhu, Kefan, Sharma, Bibhu, Phan, Phuoc Thien, Davies, James, Thai, Mai Thanh, Hoang, Trung Thien, Nguyen, Chi Cong, Ji, Adrienne, Nicotra, Emanuele, Lovell, Nigel H., Do, Thanh Nho

arXiv.org Artificial IntelligenceFeb-3-2024

Work related musculoskeletal disorders (WMSDs) are often caused by repetitive lifting, making them a significant concern in occupational health. Although wearable assist devices have become the norm for mitigating the risk of back pain, most spinal assist devices still possess a partially rigid structure that impacts the user comfort and flexibility. This paper addresses this issue by presenting a smart textile actuated spine assistance robotic exosuit (SARE), which can conform to the back seamlessly without impeding the user movement and is incredibly lightweight. The SARE can assist the human erector spinae to complete any action with virtually infinite degrees of freedom. To detect the strain on the spine and to control the smart textile automatically, a soft knitting sensor which utilizes fluid pressure as sensing element is used. The new device is validated experimentally with human subjects where it reduces peak electromyography (EMG) signals of lumbar erector spinae by around 32 percent in loaded and around 22 percent in unloaded conditions. Moreover, the integrated EMG decreased by around 24.2 percent under loaded condition and around 23.6 percent under unloaded condition. In summary, the artificial muscle wearable device represents an anatomical solution to reduce the risk of muscle strain, metabolic energy cost and back pain associated with repetitive lifting tasks.

artificial muscle, lifting, lifting task, (17 more...)

arXiv.org Artificial Intelligence

2402.02319

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Iowa > Polk County > Des Moines (0.04)
North America > United States > Illinois > DuPage County > Elmhurst (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Musculoskeletal (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces (0.68)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization

Cheang, Chi Seng, Chan, Hou Pong, Wong, Derek F., Liu, Xuebo, Li, Zhaocong, Sun, Yanming, Liu, Shudong, Chao, Lidia S.

arXiv.org Artificial IntelligenceNov-2-2023

Recent pre-trained language models (PLMs) achieve promising results in existing abstractive summarization datasets. However, existing summarization benchmarks overlap in time with the standard pre-training corpora and finetuning datasets. Hence, the strong performance of PLMs may rely on the parametric knowledge that is memorized during pre-training and fine-tuning. Moreover, the knowledge memorized by PLMs may quickly become outdated, which affects the generalization performance of PLMs on future data. In this work, we propose TempoSum, a novel benchmark that contains data samples from 2010 to 2022, to understand the temporal generalization ability of abstractive summarization models. Through extensive human evaluation, we show that parametric knowledge stored in summarization models significantly affects the faithfulness of the generated summaries on future data. Moreover, existing faithfulness enhancement methods cannot reliably improve the faithfulness of summarization models on future data. Finally, we discuss several recommendations to the research community on how to evaluate and improve the temporal generalization capability of text summarization models.

computational linguistic, dataset, knowledge, (14 more...)

arXiv.org Artificial Intelligence

2305.01951

Country:

Europe > United Kingdom (0.47)
Asia > Macao (0.05)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry: Government > Regional Government > North America Government > United States Government (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback